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Abstract 

This paper explores the mixing time of the random transposition walk 
on permutations with one-sided interval restrictions. In particular, we're 
interested in the notion of cutoff, a phenomenon which occurs when mixing 
occurs in a window of order smaller than the mixing time. One of the main 
tools of the paper is the diagonalization obtained by Hanlon J}; the use 
of the spectral information is inspired by the famous paper of Diaconis 
and Shahshahani on the mixing time of the random transposition walk 
on S„ |3]. The diagonalization allows us to prove chi-squared cutoff for 
a broad class of one-sided restriction matrices. Furthermore, under an 
extra condition, the walk also undergoes total variation cutoff. Finally, a 
large collection of examples which undergo chi-squared cutoff but in which 
total variation mixing occurs substantially earlier and without cutoff is 
produced. These results resolve a conjecture of Diaconis and Hanlon from 
Section 5 of Hanlon's paper. 

1 Introduction 

This paper studies the mixing time of the random transposition walk on per- 
mutations with one-sided interval restrictions. Intuitively, the mixing time of a 
random walk is the number of steps it takes a random walk to get close to ran- 
dom. Permutations with restricted positions are elements a of the symmetric 
group Sn such that for each i, a{i) is only allowed to be in a particular subset 
of {1, 2, . . . , n}. These are specified by a {0, l}-matrix M, where a{i) is allowed 
to be equal to j precisely if M{i,j) = 1. Such a matrix M is called a restriction 
matrix. 

Let Sm be the set of permutations corresponding to M . More concisely, 

Sm = {cr I M{i, a{i)) = 1 for aU i} (1.1) 

Define S{i) = {j \ M{i,j) = 1}. Under this definition, S{i) is the set of allowable 
values for a{i). If S{i) is an interval for each i, then M is called an interval 
restriction matrix. If M isnxn, and S{i) ~ [a^, n\ for each i for some a,;, then M 
is a one-sided interval restriction matrix. For such matrices M, the permutations 
in Sm correspond to rook placements on a Ferrers board, an object obtained 
by removing a Ferrers diagram from one of the corners of an n x n chessboard. 



These objects have an elegant combinatorial structure, originally studied by 
Foata, and Schiitzenberger [5], and then later by Goldman, Joichi and White 
0. In particular, it is possible to perfectly sample elements of Sm- 

The random transposition walk on permutations with one-sided interval re- 
strictions proceeds as follows. First, pick a number i from {l,2,...,n} with 
your left hand, and an independent number j from {1, 2, . . . ,n} with your right 
hand. Note that it is is possible to choose the same number. If the cards in 
positions i and j can be transposed while remaining in the set Sm, do so. If 
this is not possible, go back and pick a different pair i' and j' using the same 
procedure. Continue selecting pairs until arriving at one that can be exchanged 
without leaving S]\{. Transposing this pair will then be the next step in the 
chain. 

This random walk was diagonalized by Hanlon [7] , who derived all the eigen- 
values and eigenvectors in terms of combinatorial objects described in Definition 
15.41 This amazing result is an extension of the diagonalization of the random 
transposition walk on Sm which uses the representation theory of the symmetric 
group (see James and Kerber [8J for a treatment of the standard theory). The 
diagonalization of the walk on Sm was possible despite the absence of a group 
structure. In this paper, the spectral information is used to obtain very sharp 
results on the mixing time of the chain under further restriction to 'two-step' 
one-sided restriction matrices, where each S{i) is either equal to [l,n\ or [a,n\ 
for some fixed a. This subclass of one-sided restriction matrices was selected 
because they generate a vertex transitive random walk and as such require only 
eigenvalue and not eigenvector information to examine the mixing time; this 
result is stated precisely in Corollarv l2.2l 

Any discussion of mixing time requires a notion of distance: a random walk is 
close to random when the distance between the distribution of the random walk 
at time t and the stationary distribution tt is small. A common distance is the 
total variation distance ||/Lt— i/||tv, which is defined to be the distance between 
jU and v. This distance has the advantage of measuring an intuitive quantity: 
it is the maximum error that can be made by estimating the probability viA) 
with n{A). However, this distance is not amenable to spectral analysis. As 
such, a second notion of distance is necessary. The appropriate distance is the 
chi-squared distance, which is defined to be the distance between ^ and 1 
(see Equation (|2.ip ). It is easy to show that the chi-squared distance is an upper 
bound on the total variation distance. In many examples, the chi-squared and 
total variation mixing times match, making the chi-squared distance an excellent 
tool when spectral information is available. 

The advantage of having complete spectral information is that it can be 
used to obtain very precise bounds on chi-squared distance at any time t. In 
particular, it can be used to show that the random walk experiences cutoff: a 
phenomenon which occurs when the walk transitions from being barely mixed 
to being thoroughly mixed in a window much smaller than the mixing time (see 
Equation (|2.2p for the precise definition) . The notion of cutoff can be applied to 
any distance; accordingly, this paper will examine both total variation and chi- 
squared cutoff. The present approach is inspired by the results of Diaconis and 
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Shahshahani, who used the eigenvalues of the random transposition walk on Sn 
to show total variation cutoff of the walk at time ^nlogn with a window of size 
n [3]. This paper makes use of the techniques of this proof, especially the more 
streamlined version presented by Diaconis in the book "Group representations in 
probability and statistics" [2J. However, as will be seen below, the calculations 
in this paper are much more involved. 

The main results of this paper are as follows: for two-step restriction ma- 
trices, there is a broad class of matrices for which the random transposition 
walk experiences chi-squared cutoff. This result is stated precisely in Theo- 
rem 12.121 However, the total variation mixing time is more complicated: an 
extra condition is needed to ensure total variation cutoff (see Theorem I2.13p . 
and these cases require an additional combinatorial argument. Furthermore, 
in Theorem 12. 14[ a class of examples is given for which total variation mixing 
occurs considerably earlier, and for which cutoff does not occur. In particular, 
these results resolve the conjecture stated as Theorem 5.6 in Hanlon's paper. 
Using the definitions above, the conjecture states that if 



S{^) 



[l,n] i < n" 

[2, n] n" < i < n 



then the random walk has cutoff. (There's a slight typo in the paper: it 
states that cutoff occurs around time ^n^~°' log n, whereas it should actually 
be jn^~"logn.) This turns out to be partially correct: while the statement is 
true for chi-squared mixing time, it does not hold total variation mixing time. 
Indeed, the total variation mixing time occurs substantially earlier and without 
cutoff. 

Therefore, the results in this paper are instructive in a number of ways. In 
a subset of cases, they show the utility of spectral information in showing cutoff 
with respect to both chi-squared and total variation distance. However, they 
also show that there is a broad class of random walks for which the spectral 
information only contains information about chi-squared distance, since total 
variation mixing occurs substantially earlier. Furthermore, in the latter case, it 
is demonstrated that the behavior of the chi-squared distance can be entirely 
different from the behavior of total variation distance, since in the examples pro- 
vided, chi-squared distance undergoes cutoff while the total variation distance 
does not. 

This paper also raises a number of fascinating questions about the random 
transposition walk on permutations with one-sided interval restriction. While 
this paper obtains strong results for a broad class of two-step restriction ma- 
trices, I am confident that some of the hypotheses in Theorems 12.121 and 12.131 
are extraneous. Furthermore, while it was convenient to restrict attention to 
the two-step case in order to simplify calculations by using only eigenvalue in- 
formation, Hanlon's paper also derives the eigenvectors of the random walk. 
Therefore, while the calculations would likely be more challenging, it should 
be possible to apply the ideas below to a broader class of one-sided interval 
restriction matrices. It would also be very interesting to find conditions on one- 
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sided restriction matrices which would guarantee that the chi-squared and total 
variation mixing time are of the same order. For a full discussion of the many 
questions which arise naturally from this paper, see Section [TUl 

2 Definitions and Setup 

There are a number of definitions needed before the results can be fully stated. 
Let jjL and v are two probability distributions on a finite state space Vt. Then, 
the total variation distance between and v is defined as 



For a Markov chain with transition probabilities y) and stationary distri- 
bution TT, define d{t) — •) — 7r||yy to be the distance from stationarity at 
time t. Then, the mixing time is defined as 



Conventionally, Tmix is chosen to be Tniix(l/4). 

As noted above, this paper also uses a second notion of distance. Define the 
chi-squared distance between distributions /i and tt with respect to tt to be 



The chi-squared mixing time can be defined in a way which is entirely analogous 
to the definition of the total variation mixing time. 

As should be readily apparent, the chi-squared distance is the L'^{tt) distance 
between ^ and 1. Furthermore, it is easy to see that — 7r||j,y is precisely half 
the L^{Tr) distance between ^ and 1. This means that chi-squared distance is 
an upper bound for twice the total variation distance. Furthermore, chi-squared 
distance turns out to precisely computable if all the eigenvalues and eigenvectors 
of the matrix P are known. The following two results are standard, and follow 
from Lemma 12.16 in [S]. 

Theorem 2.1. Let P be the transition matrix of an irreducible reversible Markov 
chain on f2 with eigenvalues 1 — f5o > /3i > ••• >/3|n|_i > —1, and with a cor- 
responding basis of orthonormal eigenvectors {vj}. Then, 



The above theorem requires all the information about the eigenvalues and 
eigenvectors to compute the chi-squared distance. However, given a sufBcient 




min{i I d{t) < e} 




(2.1) 
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degree of symmetry, eigenvalues suffice. Say that a Markov chain on a state 
space fl is vertex transitive if for every x,y G fi, there exists a bijection / : 
J7 — >■ ri such that f{x) = y, and / preserves the random walk. In this case, the 
following easy corollary can be proved. 



Corollary 2.2. // the assumptions from Theorem 
chain is also vertex transitive, then 



hold, and the Markov 



\P\x,-) -ttI 



TV 



|a|-i 



i=l 



Finally, it is necessary to define the cutoff phenomenon rigorously. This 
definition can apply to any notion of mixing time; in this paper, it will be 
applied both to total variation and chi-squared mixing times. Consider a family 
of Markov chains {X")t>Q with mixing times T^^^{e). Then, cutoff is present if 
for all e > 0. 



lim 



= 1 



(2.2) 



r". d - e) 

This is clearly equivalent to saying that for sufficiently large n, mixing occurs 
in an arbitrarily small window relative to the size of t^[^. 

Now proceed to definitions specific to one-sided interval restriction matrices. 
Let M be such a matrix; then for each i S{i) must be equal to [bi,n] for some 
bi- To simplify notation, make the following definition. 

Definition 2.3. Let n be an integer, and let b = (&i, 62, • • • , b„) be a vector of 
elements of {1, 2, . . . , n}. Define M{b) to be the nxn interval restriction matrix 
satisfying S{i) — [bi, n] for all i. 

Here is an example: if n = 4 and 6 = (I, f , 2, 3), then 



M{b) 



fill 
fill 
f I I 
I I 



Remark 2.4. The notation here is different from Hanlon's - he doesn't explic- 
itly talk about restriction matrices and uses the notation Rn{b) for the set we 
have called Sj^^fj^y 

Lemma 2.5. Let M be a restriction matrix, and let N be M with its rows 
permuted. Then there is an isomorphism between the random transposition walk 
on N and the random transposition walk on M . 

Proof: This result is straightforward (and is simply assumed in Hanlon [7].) 
For a full discussion, see Lemma 3.1 in my thesis [I]. □ 
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Lemma 12.51 shows that permuting the rows of M doesn't change the under- 
lying graph. Therefore, from now on assume that the vector b satisfies 

61 < 62 < • • • < 6„ (2.3) 

With the above assumption, Proposition 2.1 (d) in Hanlon shows that: 

Lemma 2.6. If bi <i for all i then the graph on Sj^.^^^ induced by the random 
transposition walk is regular, with degree 

n 

A = ^(z - 6.) (2.4) 

i=l 

If bi > i for some i, the set Sj^jfj^^ is empty. 

With the random transposition walk as defined in the introduction, make 
the following definition: 

Definition 2.7. Given a vector b = (61, &2, • ■ • , bn), define U{b) to be the adja- 
cency matrix of Sj^f^;-^ under the random transposition walk. 



With A from Equation 12. 4[ if P is the transition matrix for the random 
walk, then for r and a that differ by a transposition (i, j), 

P(a,r) = 



+ 2A 

since i can be chosen with the right hand and j with the left hand, or vice versa. 
Furthermore, 

^ ' n + 2A 

since the same i can be chosen with both hands. This shows that 

P=-^(n/ + 2f/) (2.5) 

where / is the identity matrix. This demonstrates that finding the spectral 
information for P reduces to diagonalizing U. Correspondingly, many of the 
theorems in this paper will be stated in terms of U{b). 

As noted earlier, attention here is restricted to a subclass of one-sided re- 
striction matrices: 

Definition 2.8. Let f{n),g{n) be two functions from Ti^ toJ/^. Definebn{f, g) 
to be the vector (61, 62, . . . , 6„) such that 



b, = 



1 I < f{n) 

g{n) 4- 1 f{n) < i < n 



That is, bni f, g) = (1, 1, . . . , 1, g{n) -f 1, . . . , g{n) + 1) where the number of Is in 
the beginning of the vector is f{n). 
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Remark 2.9. From Lemma bt < i for all i is necessary for S'^j^g) not to be 
empty. This condition at i — g{n) + 1 above forces 



f{n) > gin) 



(2.6) 



to a have a non-empty walk. Thus, from here on assume the above inequality 
for functions / and g for all n. 

Now, let b — bn{f, g) as defined above, and consider the matrix M(h). By the 
above definitions, S[i] is [1, n] if i is between 1 and f{n), while S[i\ is [g{n) + l, n] 
for i between f{n) + 1 and n. This means that a G Sj^j^j^^ has no restrictions 
on the first f{n) rows, and must be at least g{n) + 1 on rows that are at least 
f{n) + 1. Alternatively, on the first g{n) columns, a is only allowed to take 
values up to f{n). For example, if n = 5, /(5) = 3 and g{5) = 2, then 



65(/,g) = (1,1,1,3,3) 



(2.7) 



and 



M{b) 



11111 
11111 
11111 
111 
111 



The next lemma calculates the degree A of a vertex in Sm for above matrices 

M. 

Lemma 2.10. Let bn{f,g) be defined as in Definition \2.8\ above, and let M„ = 
M{bn{f,g)). Then, if A is defined as in Equation (|2.4I1 above, 



n'^ — n — 2ng(n) + f{n)g{n) 



Proof: From the definition of bi, 



A = ^(*-M = 



n{n + 1) 



^6, 



n{n + 1) 



-f{n)-l-{n-f{n))-{g{n) + l) 



— n — 2ng{n) + f{n)g{n) 



as desired. □ 
Notation 2.11. From now on, when / and g are implied, the convention 

M„ = M(6„(/,5)) (2.8) 

is used to simplify notation. 
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The first theorem below shows that, given certain assumptions on f{n) and 
g(n), the walk always achieve cutoff in chi-squared distance. 

Theorem 2.12. Let bn = bn{f,g) for some functions f and g which satisfy 
fin) > g{n) for all n, and which also satisfy 

/("-) 

lim — and lim f{n) — oo 

n— ^oo n n^QO 

and consider the random transposition walk on Sm„ ■ Let A denote the degree 
of the graph induced on Sm; hence, from Eauation \2.10\ above, n + 2 A = — 
2ng{n) + f{n)g{n). Then, there is chi-squared cutoff around 

(n + 2A)(log/(n)+logg(n)) 
4/(n) 

with a window of size ^^j^- More precisely, 
1- If 

(n + 2A)(log/(n)+log,9(n)) , n + 2A 

iTN ^'^W ^'-'^ 



then 

II Ptr-r ."1 _ ttI, 

l2,ir 



P\x,-)-t:\\ <4e"§ 



for c > 10 and n sufficiently large. 
2. Furthermore, if c > and 

(n + 2A)(log/(n) + logg(n)) n + 2A 

' = iTN ^'-'"^ 

then 

||P*(a;,-) -ttII > 

II V ' / Il2,ir — 2 

for n sufficiently large. 

As noted earlier, there are also interesting results for the more intuitive total 
variation distance. The following theorem states assumptions on / and g which 
imply total variation cutoff for the walk, while the theorem after it presents 
a class of examples for which total variation mixing occurs significantly before 
chi-squared mixing, and in which total variation cutoff does not occur. 

Theorem 2.13. Assume that f and g satisfy the hypotheses for Theorem \2.12\ 
above, and furthermore, that liminf„^oo > r > 0. Let AI„ = M{bn{f,g)) 
as in Equation (|2.8p above. Then, if 

^_ (n + 2A)(logff(n)+log/(n)) n + 2A 

4/(n) Uf{n) ^'■^^> 

then the random transposition walk on Sm„ satisfies 

liminf ||P*(a;, •) - 7r|| > - - e"''*'" 
and thus the walk hasn't mixed by time t for sufficiently large c. 
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Theorem 2.14. Let g{n) = 1, and assume that 

Ti 

lim f{n) — oo and f{n) < — for sufficiently large n. 

Then, the random transposition walk on Sm„ mixes in total variation distance 
in order "^^^ time. Furthermore, the walk does not have total variation cutoff. 

Remark 2.15. Hanlon's conjecture in his Theorem 5.6 concerns the case where 
g{n) = 1 and f{n) = n". As such, it is one of the many examples covered by 
the theorems above. After a httle simphfication, Theorem 12.121 states that this 
walk has cutoff around time jn^^" log(7i) with a window of size n^^". It is also 
clear that this case is one of the examples covered by Theorem 12. 141 Therefore, 
total variation mixing occurs in order time, and occurs without cutoff. 

Thus, the above results resolve the conjecture. 

The next section gives heuristics and proves Theorem 12. 131 while Section |4] 
proves Theorem 12.141 ^^nd also proves that the walk is vertex transitive. The 
chi-squared results are then proved. Section [5] reviews the necessary background 
material, restates Hanlon's eigenvalue results and provides examples of how they 
apply for the specific matrices M„ = M{bn{f, g)) under consideration. Section 
[6] provides bounds on the eigenvalues of the transition matrix P, which will be 
used in the following section to do lead term analysis and prove the lower bound 
in Theorem 12. 121 Finally, Sections [5] and [5] prove the upper bound in Theorem 

3 Heuristics and Total Variation Lower Bound 

This section gives heuristics and proves Theorem l2.13l First, here is an argument 
explaining why a total variation cutoff is expected around 

^_ (n + 2A)(log/(n)+logff(n)) 
4/(n) 

if f{n) and g{n) are of the same order, as the conjunction of Theorems 12.121 and 
12.131 implies. 

Consider the instructive case f{n) — g{n). For example, if n = 5 and 
f{n) — g{n) — 2, the restriction matrix is 



1 


1 


1 


1 


1 


1 


1 


1 


1 


1 








1 


1 


1 








1 


1 


1 








1 


1 


1 



In this case, a G S]\i can take values up to f{n) on the first f{n) columns. This 
makes it easy to see that a G Sm can be represented as a pair of permutations, 
one on {1,2,..., f{n)} and the other on {f{n) + 1, f{n) + 2, . . . , n}. Thus, 
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Now consider how fast the walk mixes. It has been shown by Diaconis and 
Shahshahani that the transposition walk on Sk has cutoff at time i/clogfc with 
a window of size fc |3] . At each step, the present walk uses either a transposition 

in >S'^(n) (with probability ;^q^^) or a transposition in ^^^^(n) (with probability 
^^n+2A^ ). This is equivalent to two separate walks: the walk on <S'/(„), slowed 
down by a factor of j^^, and the walk on Sn-f{n): slowed down by a factor of 
(n^ftn))'^ ■ Thus, the walk on <S'/(„) has cutoff around time 

, 1.. ^, ft ^ ri + 2A (n + 2A)log/(n) 
h = 2/(n)log/(n) . = 

with a window of size f{n)-j^~^ — "^^^ , while the walk on Sn-f{n) analogously 
has cutoff around time 

_ (n + 2A)log(n-/(n)) 
2(n -/(«)) 

with a window of size ^-f^) ■ ^^^^ (and is proved below in Lemma 

17. ip . that if f{n) is comparitively small with respect to n, then ti is the larger 
of the two times. Since in this case, /(n) = g{n), ti gives precisely the answer of 
Theorems 12.121 and 12.131 The above analysis shows that the first gin) columns 
are the limiting component of the walk. Theorem 12.121 assumes that — > 
to simplify computations. 

Remark 3.1. This discussion shows why f{n) ^ c» is assumed for the main 
theorems - in order to reasonably talk about asymptotics, the limiting term 
should at least go to oo. 

Before proving Theorem I2.13[ a supporting lemma calculating the size of 
SM(a) is needed. 

Lemma 3.2. // a = (1, 1, . . . , 1, y + 1, . . . , y + 1) is a vector of length n, where 
the number of initial Is is precisely x, and x >y, then 

I ^_x\{n~y)\ 

Proof: Count the number of elements in SM(a) column by column. How many 
choices are there for a^^{\)l Since a(i) is only allowed to be 1 for z < x, there 
are x choices for the first column. Similarly, there are {x — 1) choices for a~^(2), 
and continuing, there are (x — i + 1) choices for a~^(i) for i < y. Thus, the 
number of choices for the tuple (a^^(l), a~^(2), . . . , a^^{y)) is precisely 

x\ 

X ■ {x — 1) ■ ■ ■ {x — y + I) 



(x-yy. 



Now, consider the number of choices for a ^{y + 1). Since a{i) is allowed to 
be equal to y + 1 for every single i < n, and furthermore, a^^(l), • • • ,a^^(j/) 
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are already specifiied, there are precisely n — y choices for a~^{y + 1), n — y ~ 1 
choices for a'^{y + 2), etc. Thus, the total number of choices for these is 
precisely {n — y) ■ {n — y — 1) ■ ■ ■ 2 ■ 1 — {n~ y)\. Multiplying these together gives 

as required. □ 

Theorem 12.131 gives a lower bound on the mixing time. Here, an explicit set 
A is found such that for t in the above theorem, |P*(i(i, A) — 7r(A)| is large. In 
their paper, Diaconis and Shahshahani use the set of permutations with at least 
1 fixed point [3J. The heuristics above suggest trying a modification: clearly, 
if f{n) = g{n), the fixed points in the top f{n) x f{n) square of Af„ could be 
used. In general, the number of fixed points in the first g{n) columns works. 

Before the proof, some notation mainly used for the next two sections is 
needed. Denote the Markov chain by (^fe)^o' where 

Xk = {Xk{l),Xk{2),...,Xkin)) 

and let the random row transposition used at time k be r^. Clearly, Xk = 
Xorir2 ...rk- 

Proof of Theorem I2.13t It must be shown that at time 

^_ (n + 2A)(log.g(n)+log/(n)) n + 2A 

4/(n) Uf{n) ^'-'^ 

the walk on Sm„ has not yet mixed, where M„ — M(bn{f, g))- By assumption, 

liminf = ^ > 0. 

/(") 



n— >oo 



This implies that limsup„_^o^(log /(n) — logg{n)) < — logr. Thus, for suffi- 
ciently large n, 

^ (n + 2A)logg(n) , , ,n + 2A 
2f{n) 4/(n) 

Hence it suffices to show that the walk has not mixed at the time on the right- 
hand side above. Call this time t' . 
Define An as 

An — {(7 Sm„ such that a{i) — i for at least one i < g{n)} 

Call i < g{n) such that a{i) — i a small fixed point. The first step calculates an 
upper bound for 7r(A„) for the uniform distribution tt on Sm„- This is done by 
a standard inclusion-exclusion argument. Consider the number of ct G Sm„ such 
that a{i) = i for a specified value of i < g{n). In the language of permutation 
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matrices, to pick such a cr, mark the 1 in position cross out the ith row 

and ith column, and select the values of a on the remaining rows and columns. 
It is easy to check that this corresponds to picking an element in S'M(a)j for 
a = (1,1,...,!, g{n), . . . , g{n)) where a starts with f{n) — 1 ones and the vector 
is of length n — 1. Thus, from Lemma 13.21 above. 



1/ ^c, I r^ -ii |c I (/N-i)!(^-.gW)! 

\{a G Sm„ I ai^) = ^}\ = \SMi,)\ = 
From the same lemma, Sm = '^/f / ' . Thus, 

\{a e S'm„ I cr(i) i}l = ~JJ~j'- 

In a similar way, given k distinct values of 11,12, ■■■ ,ik in choose a 

satisfying = ij for all j by crossing out all the rows ij and columns ij. 

This would correspond to the vector a' = (1, 1, ... , 1, g{n) + l—k, . . . , g{n)+l — k) 
of length n ~ j, and thus 

IS ^ Q If-, r\ ■ XI (/(")- ^)K»^-5("))! 



\Sm 



{f{n)-g{n))\ 
^'(/(n)-fc)! 



Picking an element cr that has precisely k small fixed points involves choosing 
the values zi, ^2, . . . , u- above. Thus, 

fg(n)\ f{n)\ 
\{a e Sm„ I cr has k small fixed points} | = I ' ) \Sm„' 



k J ' (/(n) -fc)! 

^ ,0 I g(") ■ ■ • (g(") - fc + 1) 

' I •••(/(«)- A; +1) 

Now, the standard inclusion-exclusion argument gives 

'gin) g{n){gin)-l) , g{n){gin) - l){g{n) - 2) 



\An\ - \S_ 



/(n) 2f{n){f{n) - 1) 6/(n)(/(n) - l)(/(n) - 2) 



From /(ri) — )> cxd, and liminf > 0, g{n) 00. Thus, the above sum becomes 
arbitrarily well-approximated by 



and thus 7r(A„) is arbitrarily well-approximated by 1 — e s{ri)/f{n)_ Therefore, 

g(") 
/(") 



noting that 1 — e ^ is an increasing function, and that Sip^ < \ 



limsup7r(A„) < 1 ~ -. 

e 
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Now, consider the probability of A„ after t' steps of the random walk, started 
at the identity. Define Ffe to be the number of small fixed points of our random 
walk at time k: that is, the number of rows i < g{n) such that Xk{i) = i. Con- 
sider the distribution of Ffe. Note that since the walk starts at the identity, any 
i < 9{n) that has not yet been tranposed with anything is in Ff. Furthermore, 
if Xi.{i) < g{n), row i can only be transposed with row i if j < /(n) - that is, a 
row corresponding to one of the first g{n) columns can only be transposed with 
one of the first f{n) rows. Otherwise, Xk+i{j) = Xk{i) < g{n), which is not 
allowed for j > /(n). 

From the arguments above, _Fo = {1. 2, . . . , g(n)}, and furthermore, row i 
can leave the set -Ffe_i only by being transposed with a row that is at most 
/(n). Consider two cases - first, the next step could transpose two rows which 
are both at most g{n). This has probability 

Pirk = {ij)\t,J<9{n))= ^^""^^ 



n + 2A 

Secondly, the next step could transpose a pair of rows one of which is below 
g{n) and the other one is between g{n) and f{n). This has probability 

P {Tk = {ij) I i < 9{n) < 3 < f{n) or vice versa) = ^9in){nn)^-^g{n)) ^ 

The argument proceeds by estimating the probability of having transposed 
each element from {1,2, . . . ,g{n)} at time t'. Note that if two rows below g{n) 
are transposed at time k, then they both leave the set Fk-i, whereas the second 
case above corresponds to only one row leaving the set. This rephrases the 
question as the following coupon collectors problem: at each step, collect two 
coupons with probability ^^_^2A coupon with probability _ 

What is the probability of not collecting every coupon from {1,2, g{n)} by 
time t'? 

Begin by calculating the mean number of coupons collected at each step, 
counting each coupon however many times it's collected. This number is clearly 

E (coupons collected in one step) = 2 • ^ + 1 • Mn){fin)- g{n)) 
^ ' n + 2A n + 2A 

_ 2/(n)g(n) 

n + 2A 

Similarly, the variance of the number of coupons collected in one step is bounded 
above by Thus, after t' total steps of the walk, the total number of 

coupons collected is concentrated around 

2/(n)g(n) _ / (n + 2A)logg(n) n + 2A \ 2j{n)g{n) 

n + 2A V 2/(n) ^^^'2j(n))' n + 2A 

= g{n)\o^g{n) - (c + log r)s'(n) 
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with a window of at most yj2g(n) \ogg{n). Since g{n) approaches oo, and since 
y^2g{n) \ogg{n) is o{g{n)), using standard coupon collector arguments [4j we 
can conclude that 



P* (id, An) ^1- 



1 - e" 



as n ^ oo. 
Therefore, 



lim inf 



P*' {id, •) - vr > lim inf f P*' (id, A„) - 7r(A„)') 

TV n— ^oo \ / 



> 1 - 



- ( 1-- 

e 



1 



the desired inequality for t' . As noted earlier, since for sufficiently large n, t < t' , 
and since distance to stationarity is non-decreasing, 

liminf \\P\id, ■) - A > - - e"''^'' 



and so the walk hasn't mixed by time t, as required. 



□ 



4 Vertex Transitivity and Example of Fast Mix- 
ing 

This section contains the proof of some of the total variation results, in particular 
Theorem 12. 141 Here, 17(71) = 1, which means that hn{f, g) = (1, . . . , 1, 2, . . . , 2). 
Hence a G Sm„ has the following restrictions: it's allowed to be at most /(n) on 
column 1, and has no restrictions at all on the remaining columns. It is easy to 
calculate that the probability of the first column being tranposed in a particular 
step is 

PfColumn 1 tranposed at step k) = '^^ ^ . — ~ '^^ 

n + 2A 

using the value for A derived in Lemma [2. 101 

Now, since f{n) < ^j"^^ for sufficiently large n, the first column gets tran- 
posed at most every |nlogn steps or so. Furthermore, the remaining n — 1 
columns have no restrictions, and hence the walk without the first column is 
just the walk on Sn~i- Since this walk has cutoff at time i(n — 1) log(n — 1), 
these n — 1 columns should be thoroughly mixed by the time the first column 
is used at all. Therefore, the walk should be mixed as soon as the first column 
is used. Furthermore, since mixing is driven by one column, a cutoff is not 
expected. 

In order to simplify calculations, it is first shown that for any b ~ t'„{f,g), 
^M{b) ^® vertex transitive. This will simplify the proof of Theorem 12.141 as the 
walk may be started at the identity. Furthermore, this will also be useful later 
for spectral analysis, as Corollary (|2.2p will be used. 
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Lemma 4.1. For Mn = M(bn{f, g)), the graph induced on Sm„ by the random 
transposition walk is vertex transitive. 

Note: This paper uses a different convention from Hanlon for permutation 
multiplication: if a and /3 are permutations, multiplication is treated as function 
composition. That is, 

(a/3)(*) 

Proof: Let tt and a be elements of Sm„ ■ A graph isomorphism : — S'j\/,^ 
is found such that 

0(7r) = a. 

Only graph isomorphisms of the form 

(t){T) = aT(3 (4.1) 

are considered, where a is a permutation in 5„ that only acts non-trivially on 
the elements {g{n) + l,g{n) + 2, . . . ,n} and /3 is an elements of Sn that only 
acts non-trivially on the elements {1, 2, . . . , /(n)}. The proof consists of the 
following steps. First, it is shown that any as defined in (|4.ip maps Sm^ to 
itself. Secondly, it is shown that any such is an isomorphism. Finally, a and 
j3 are found such that the cj) defined above satisfies 

0(7r) = a 

Step 1. Note that visually, a rotates the last n — g{n) columns of the restriction 
matrix, while (3 rotates the first f{n) rows of the restriction matrix; this formu- 
lation makes it clear that (f> must map Sm^ to itself. To prove this formally, use 
the definition 

Sm^ {t G Sn I T{i) > g{n) + 1 for aU i > f{n) + 1} . 
Thus, it must be shown that for i > f(n) + 1, 

> g{n) + 1. 

But since /3 only acts on {1, 2, . . . , /(n)}, for i > f{n) + 1, = i. Thus, 
aTj3{i) = aT{i). Since r £ Sm^i ^^^d i > f{n) + 1, r(i) > g{n) + 1. Since a only 
acts on {g{n) + 1, g{n) + 2, . . . , n}, ar(«) — a{T{i)) > g{n) + 1. Thus, 

Q!r/3(i) = aT{i) > g{n) + 1 

as desired. 

Step 2. It now must be shown that any (j) defined by (|4.ip is an isomorphism. 
By Step 1, (j) maps into S'a/,,, and it's obviously invertible, so it suffices to show 
that it preserves edges. That is, for any transposition (i,j) and t G Sm„, such 
that {i,j)T G Sm„, is a neighbor of </)(t). But 

J» = a{i,j)Tl3 = {a{i),a{j))aTl3 
= {aii),a{j))(j){T) 
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which is clearly a transposition away from (/'(r). Thus, the map 4> preserves 
edges, and hence is an isomorphism of 5m„ ■ 



Step 3. Finally, find a and j3 such that 



This entails aufi = a. Since (5 acts only on {1,2,..., /(«)}, this means that for 
i > f{n), 

a'Kp{i) = a{i) 
=J> a7r(i) = a(i) 

Thus, a must satisfy 

a7r(i) = a{i) for i > f{n) (4.2) 

This defines a on 7r(5), where S — {i > f{n)}. Since tt £ Sm^, for all 
i > /(n), 7r(i) > g(r7,). Thus, 7r(5') C {g{n) + l,g{n) + 2, . . . ,n}. Furthermore, 
for i > f{n),a{i) > g{n): therefore. Equation (|4.2p says that a must send a 
subset of {g{n) + 1, g{n) + 2, . . . , n} to some other subset of {g{n) + l,g{n) + 
2,...,n}. Therefore, it is possible to pick a permutation that acts only on 
{g{n) + l^gin) + 2, . . . ,«} and satisfies Equation (|4.2p : call this permutation 

Now, define /3o = Tr^^aQ^^CT. Clearly, with this definition, 

(j){Ti) = aoTT^o = cr 

To check that this /3o acts only on {1,2,..., /(n)}, it suffices to show this for 
Pq^. Let i > f{n): 

fiQ^{i) = a~^ao7r(i) = cr"^(cr(i)) = i 



where the second equality follows by Equation (|4.2[) . which was used to define 
ao- Thus, . and hence /3o, fixes {f{n) + 1, /(n) + 2, . . . , n}, so /3o acts only 
on {1, 2, . . . , /(n)}. Therefore, for this choice of ao and /3o, — (^qt/^o is an 
isomorphism of Sm^ which maps tt to cr, as required. □ 

Remark 4.2. Unfortunately, Sm is not vertex transitive for all one-sided inter- 
val restriction matrices. While I have not discovered an easy characterization 
for vertex transitivity, it is easy to provide counterexamples. For example, let 



M 



11111 
11111 
11111 
1111 
111 



Both cr = 12345 and r 
calculation shows that 



45123 are in Sm 
5207 



117649 



Furthermore, a simple computer 
5287 



117649 
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implying that the walk is not vertex transitive. I conjecture that Sm is not 
vertex transitive for almost all one-sided restriction matrices that are not 'two- 
step.' 

Return to proving Theorem 12.141 following the outline at the beginning of 
this section. 

Lemma 4.3. Let T be the first time the first column is used in the walk. Fur- 
thermore, fix e and assume that i is chosen such that at time i — 1, the random 
tranposition walk on Sn-i is within e in total variation distance from uniformity. 
Then, if n is the uniform distribution on for t>i, 

Thus, the conditional distribution of Xt given T = i is very close in total vari- 
ation to the uniform distribution. 



Proof: From Lemma [4.11 the walk is vertex transitive. Without loss of gen- 
erality, the walk starts at the identity. Now, at times t > i, the walk simply 
evolves as usual, since the event conditioned on happened already. Since the 
total variation distance of a Markov chain to stationarity is non-decreasing, for 
t > i, 

i J2 \P{Xt = x\T = i)-n{x)\<^ J2 \PiX^=x\T = ^)^^T{x)\ 

Clearly, the expression on the right-hand side above is the just total variation 
distance between Xi conditioned on T = i and stationarity. Thus, it is equal to 

sup \¥{XieB\T = i)-n{B)\ 

BCSa/„ 

and that is precisely what will be bounded. Accordingly, fix a subset B of Sm^- 
By definition, until time i the walk is precisely the random transposition walk 
on the permutations of {2,3,-- - ,n}, and hence it is identical to the random 
transposition walk on Sn-i- Recall that i is chosen such that this walk is within 
e of stationarity at time z — 1. Now, let 

S = {ae Sm„ I <y{l) = 1} = (4.3) 

and let fr is the uniform distribution on S. For any B C_ S, 

|P(X,_i e B \T = i)-n{B)\ < e. 

Now consider the transposition which takes to Xi. Conditioning on 
T = i, this transposition is uniformly distributed between all the transpositions 
involving the first row (as the walk starts at the identity, this is equivalent to 
using the first column.) Note that among these, the transposition (1,1) appears 
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once, while the remaining transpositions (1,A:) appear twice. This means that 
there are precisely 2/(n) — 1 possible choices for r^. Now, determines the 
value of Xi on column 1. Accordingly, write B — Bi L) B2 U ■ ■ ■ L) Bf(^n)-: where 

Bk^{ae B\a{k) = 1} 

This decomposes B into equivalence classes that depend on the value oi a & B 
on column 1. Since — 1, if = (1, j), then Xi{j) — 1. Therefore, Xj is 

in Bfc if and only if is (1, fc) - that is, if at time i rows 1 and k are transposed. 
Hence, 

nX^&Bk |r = i)=P(X,_i(l,fc)€Bfe |r = z)P(r, = (l,fc)) 
= P(X,_i G fc) I T = i)¥{n = (1, fc)) 

As noted earlier, there are precisely 2 fin) — 1 possibilities for r^, and the cases 
fc = 1 and fc 7^ 1 are slightly different. Consider those separately. For fc = 1, 

P(X, e Bi I T = i) = P(X,_i e Bi I T = i)¥{r, = (1, 1)) 



P(X,_i £ Bi |T 
2/(n) - 1 



< 



2/W 



(4.4) 



and for fc ^ 1, 



nx^ eBk\T = i)= P(X,_i e fc) I T ^ z)P(r, = (1, fc)) 

^ 2P(X,_i eBfc(i,fc) \T = i) 

2f{n) - 1 



(4.5) 



By choice of i, P(Xi_i e -Bfc(l, fc) | T = i) is well-approximated by 7r(i?fe(l, fc)), 
where tt is the uniform distribution on S as defined above in Equation (|4.3I) . It 
is easy to see that 7f(i?fe(l, fc)) = f{n)n{Bk(l,k)) = f{n)TT{Bk), and thus that 



|P(X,_i GBfe(l,fc) |r = z)-/(n)7r(Bfe)| <e 
for each fc. Therefore, for fc ^ 1, using Equation (|4.5I1 . 

2P(x,_i eBfe(i,fc) |r = z) 



|P(X, eBfc |r = z)-7r(5fe)| = 



-7r(Bfe) 



< 



< 



2/(n) - 1 

2|P(X,_i eBfc(l,fc) |T = z)-/(n)7r(Bfc)|+^(Bfc) 



2/(n) - 1 



2e + 7r(gfc 
2/(n) - ] 
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Combining the above with Equation (|4.4p . 

\F{X, e B \ T = i) - 7TiB)\ = 



f(n) 

fc=l 



^)-7^(i?,)) 
/(") 



k=2 



<P(X, e Bi \ T ^i)+^{Bi) + Y, 
< n{Bi)+€ + 



2e + ^(gfc) 
2f{n) - 1 



1 / ^^"^ \ 



Now 



, since Bi C S*, and 7r(S') is clearly j^, '"'(-Si) < 2/(n)-i • ^l^o, 
/(") 

^ ^(Bfc) < 7r(B) < 1 

fc=2 

Plugging these back into Equation (|4.6I) . 



|P(Xj e B |T = i) - Tr(B)| < 



and hence 



as required. 



2/(n) - 1 
4 



sup \P{X,eB\T^i)-niB)\ < 
bqSm„ ^J[n) - i 



□ 



Turn now to the proof of Theorem 12.141 As above, condition on the first 
time the first column is transposed. 

Proof of Theorem 12.141 Without loss of generality, assume that the walk 
starts at the identity. Define 

3(n + 2A) 



* 2/(n) - 1 ■ 

It must be shown that the walk is sufficiently mixed by time t. 

Let T is the first time that column 1 is used in a transposition. Then 



(4.7) 



|P*(zd, •) - n\\^^ = ^J2 \P\^d,x) - 7r{x)\ 



2 

X 



P{Xt = a; I T = i)F{T = i) - tt{x) 

oo oo 

P{Xt = a; 1 T = i)F{T = «) - E ^i^M^ = *) 



°° 1 

<^p(r = z)-^|p(Xi = x|r = z)-^(a 



(4- 
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Now, from the results of Diaconis and Shahshahani, the random transposi- 
tion walk on Sn-i has cutoff at time ^(n — 1) log(n — 1) with a window of size 
n — 1. Fix e > 0. If i > nlogn, then for sufficiently large n, the total variation 
distance between the random transposition walk on Sn-i at time i and the uni- 
form distribution is less than e. Thus, by Lemma [4.31 above, if ? > nlogn and 
n is sufficiently large, then for t > i, 

i J2 \P{Xt = x\T^i)-n{x)\<—^—+e (4.9) 



Now, using Equation ()4.8p . since the total variation distance between distribu- 
tions is always at most 1, \\P*{id, •) — 7r|| is bounded above by 

* 1 

P{T = i)-'Y\V{Xt^x \T ^i) -7r{x)\+F{T [n\ogn,t]) (4.10) 

i—n log n X 

The probability of transposing the first column in one step is precisely ^^^^a^ ■ 
Thus, 

P(T < nlogn) = 1 -P(T > nlogn) 

_2/(n)-l^">°*^" 



n + 2A 

Since /(n) < -^-^^ for sufficiently large n, and since n + 2A w n^, 
V n + 2A y 

for sufficiently large n. Similarly, the probability that T > t is just 

l-M!^y<expf-^M!^Ue- 
n + 2Ay- V n + 2A J 

and thusP(T ^ [nlogn,^]) < 1 - e~i/2_,_g-3 ^ o.45. Next, by Equations (|TTU)) 
and (|4.9p . for sufficiently large n, 

* 1 

||P*(zd,-) -7r||j,^ < ^ P(T = i)-^|P(at=a;|T = i)-7r(a;)|-|-0.45 

i^n log n X 



-n log n 



2/(n) - 1 

Since e can be chosen to be anything, and /(n) — ^ oo, the above is clearly less 
than i for sufficiently large n. Thus, the walk has mixed by time t = 2/(n)^^i ' 
which is clearly of order "^^^ . 
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Finally, the walk does not have cutoff: starting at the identity, 



P (^.(1) - 1) > F(T > fc) = (^1 - ' 

Under the uniform distribution tt, the probability of 5 = {cr G Sm„ \ 17(1) = 1} 
is precisely This means that 



TV - y n + 2A J f{n) 

It is easy to see that the above function falls off smoothly as opposed to ex- 
hibiting cutoff. For example, at time 2t ~ ^/(n)^^i ' approximately e~^, 
which does not approach as n — > oo. Thus, the walk does not have cutoff, 
completing the proof. □ 



5 Hanlon's Results and Other Preliminaries 



. , bn) be a sequence chosen from {1, 2, . . . , n} 
Call u and v left- equivalent if fe„ — by. Let 



The remainder of this paper focuses on proving Theorem I2.12[ which is con- 
cerned with chi-squared cutoff. This section contains a review of Hanlon's work 
and other background material, which shows how to diagonalize the adjacency 
matrix for Sj^^^j^y The following is Definition 4.2 from his paper [7]: 

Definition 5.1. Let b = (6i,. 
satisfying foi < 62 < • • • < • 
Li, L2, ■ ■ ■ , Lg denote the left- equivalence classes of {1, 2, . . . , n}, where the or- 
dering is chosen so that whenever i < j , the elements of Li are less than the 
elements of Lj . 

For notational simplicity, define fe„+i = n + 1, and call u and v right- 
equivalent if there exist i and i-\-l such that bi < u,v < It is straightforward 
to check that the number of left- equivalence classes is equal to the number of 
right- equivalence classes. Let _Ri, . . . , Rg denote the right- equivalence classes of 
{1, 2, . . . , n}, where the Ri are ordered in the same way as the Li above. 

Example 5.2. Left-equivalence and right-equivalence classes are very easy to 
visualize via the restriction matrix M{b). For example, let b = (1,1,1,2,4). 
Then, the corresponding M{b) is below: 



M{b) = 



11111 
11111 
11111 
1111 
1 1 



Now, imagine the restriction matrix as aii n x n chessboard, where each 
square contains either a or a 1. Shade in the squares that contain Is, and 
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look at the southwest boundary of the shaded area. This has a 'step pattern': 
in the above example, go down 3 steps, right 1 step, down 1 step, right 2 steps, 
down 1 step, right 2 steps. It is easy to check that the left-equivalence classes 
correspond to the down stretches of the step pattern, and the right-equivalence 
classes correspond to the stretches pointing right. 

Using this, for b as defined above, the left-equivalence classes are {1, 2, 3}, {4} 
and {5}, while the right-equivalence classes are {1}, {2, 3} and {4, 5}. (This visu- 
alization also explains why the numbers of left-equivalence and right-equivalence 
classes match.) 

Remark 5.3. It should be clear that if bn{f,g) is defined as in Definition 12. 8[ 
then Mn is a 'two-step' restriction matrix: that is, the southwest boundary 
described above will go down for f{n) steps, then will go right for g{n) steps, 
then will go down for n — f{n) steps, then will go right for n — g{n) steps. Thus, 

Li = {1, 2, . . . , f{n)}, L2 - {fin) + 1, . . . , n} (5.1) 

and 

i?i = {1, 2, . . . , g{n)}, L2 = {gin} + 1, . . . , n} (5.2) 

For example, if n = 5, /(5) = 3 and 5(5) = 2, then b = (1,1,1,3,3) as in 
Equation (|2.7p above. From the M(6) below it, it's clear that the left-equivalence 
classes are {1, 2, 3} and {4, 5}, while the right-equivalence classes are {1, 2} and 
{3, 4, 5} matching the expressions above. 

The following is Definition 4.3 from Hanlon [7]. 

Definition 5.4. A b-partition a — (Ai, /ii, A2, . . . , /^s-i, As) is a sequence of 
partitions such that 

1. Xi ^ ^ii K+i for all i = 1, 2, . . . , s — 1 

2. \Xi \ pLi\ = \Ri\ for a/H — 1, 2, . . . , s ( defining fi^ = ^) 

3. \ = l-^i+il for a/H = 0, 1, . . . , s — 1 (defining jiQ = %) 

It is helpful to have some standard terminology (a good reference for this 
is Stanley [TT|). Recall that a Ferrers diagram is a finite collection of boxes, 
arranged in left-justified rows, such that each row has at least as many boxes 
as the row directly below it. If (ai, 02, ... , a„i) is a partition, then the Ferrers 
diagram associated to it has precisely ai boxes in row i. For example, below is 
the Ferrers diagram of the partition (4, 2, 1) of 7: 
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The transpose of a partition A is a partition whose Ferrers diagram is 
a reflection of the Ferrers diagram of A along the main diagonal y = —x. For 
example, for A — (4, 2, 1), the tranpose partition A"^ is (3, 2, 1, 1). 

Ferrers diagrams give a good way of visualizing ^-partitions. If the left- 
equivalence classes are the sets Li, L2, ■ ■ ■ , Lg and the right-equivalence classes 
are the sets R2, . . . , i?^, then to get a ^-partition start with /ip = 0, add \Li\ 
squares to get to Ai, delete |i?i| squares to get /ii, add IL2I squares to get A2, 
etc. For example, ii b = (1, 1, 1, 3, 3), with left-equivalence classes {1, 2, 3} and 
{4,5}, and right-equivalence classes {1,2} and {3,4,5}, then the following is a 
6-partition: 



/xo =0, Ai = 



Ml 



M2 



A labeling of a Ferrers diagram is a Ferrers diagram with numbers filled into 
the boxes. A standard Young tableau is a labeling in which the entries of each 
row are strictly increasing from left to right, while the entries of each column 
are strictly increasing from top to bottom. A semi-standard Young tableau has 
strictly increasing columns, but weakly increasing (nondecreasing) rows. The 
following is a semi-standard Young tableau for the partition (4, 2, 1) of 7: 



1 


1 


2 


3 


2 


4 






3 









Definition 5.5. Let a and /3 be partitions satisfying ai > Pi for all i. Let 

n = \a\ — Then, the skew-shape a//3 is well-defined, and there exists a rep- 
resentation of Sn corresponding to a//3. Denote the degree of this representation 
by l^a/pl- 

Remark 5.6. It is well-known that |A'q,/^| is the number of standard Young 
tableaux of shape a//3 (a good source for this and results like it is Macdonald's 
book do]). 

Definition 5.7. Let a = (Ai, /ii, A2, . . . , As) be a b-partition. Define the indi- 
cator tableau T{a) to be the tableau whose entry in a square x is the number of 
skew shapes Xi/fii^i that contain x. Denote this entry by Tx{a). 

Now, recall that the content of a square x in a Ferrers diagram is denoted 
by Cx, and is defined to be j — j, where x lies in column j from the left, and 
row i from the top. The following theorem restates a main result of Hanlon's 
paper. Theorem 4.15, which derives all the eigenvalues of the adjacency matrix 
U using the language indicated above. 

Theorem 5.8. For every b-partition a = (Ai, /ii, A2, . . . , Ag), there exists an 
eigenspace i?„(a) of U{b) such that the vector space C!>5'^^fg^ decomposes as a 
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direct sum of the spaces i?„(a); that is, 

a 

where the sum runs over all b-partitions a. The dimension of Rn{a) is equal to 

s 
i=0 

where /xq = /is = 0- 

Furthermore, letting the eigenvalue of U corresponding to i?„(a) be A(a), 

{a)cx (5.4) 

X 

where the sum is over all the squares x in the indicator tableau T(a). 

The following lemma finds another expression for the eigenvalue A(q). It 
requires the following definition: 

Definition 5.9. If X is a partition, define C(A) to be the sum of the contents 
of all the squares of A. 

Lemma 5.10. If a — {Xi, fj,i, . . . , fj^^-i, Xs) is a b-partition, 

A{a) = C(Ai) - Cifii) + ■■■- C{fis-i) + C{X2) 

s s—1 

^Y^CiX,) -J2C{fi.) (5.5) 

1=1 1=1 

Proof: From Equation (15. 4[) . 

A (a) = y^r^ (a)ca; 

X 

where the sum is over all the squares x in the indicator tableau T{a). Recall 
that Tx{a) is the number of skew shapes Ai//ii_i that contain x. Thus, since 
C{po) = 0, 

i=l i=l 1=1 x£Xi 4=0 xG/^i 

= E E " E 

i—l xefii-i 
s s 

= E E "^^^ = E E cxi{x6A.M_i} 

i=l xe\i/iii^i 1=1 x£T(P) 

s 

" E c^E = E 

a:GT(/3) i=l a:GT(/3) 

= A(a) 
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as required. □ 
To diagoiialize the transition matrix P, make the following simple definition. 
Definition 5.11. Define 

Ai(q) = -V^ (5.6) 

^ ' n + 2A ^ ' 

From Equation (j2.5p . 

so Ai(q;) are the eigenvalues of P. 

6 Bounding the Eigenvalues 

Lemma |4 . II above shows that the graph on Sj^j^^^^ induced by the walk is vertex 
transitive. Thus, Corollary 12.21 and Theorem 15.81 combine to show that 



\\P\x, •) - ^11^^^ = ^^dim(i?„(a))Ai(a)2* (6.1) 

where the sum is over 6-partitions a such that Ai(a) ^ 1. Using the above 
expression calls for good bounds on the eigenvalues Ai(q;) and the dimensions 
of the eigennspaces Rn{a). This section concentrates on the eigenvalues. It is 
first shown that it suffices to consider non-negative eigenvalues Ai (a) by showing 
that the eigenvalues come in pairs. Since this is always true, it is proved for a 
general b: 

Lemma 6.1. Let a = {Xi, fii, X2, . . . , Xs) be a b-partition. Letting 

a^ = (Af,Mf,A^,...,Af) 

it can be concluded that 

dim(i?„(a^)) = dim(i?„(Q;)) and A(a^) = -A(a) (6.2) 
Proof: From Theorem [ 



dim(i?„(a^)) = nkAf/Mf 



It is well-known that for general a 3 /?, X^t^^t and X^^p are conjugate repre- 
sentations, and hence their dimensions are equal. Thus, 

s 

dim(i?„(a^)) = J| [ATa./^J \Xx^^i/^, \ = dim(i?„(a)) 

1=0 
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Furthermore, it is clear from the definition that C{a^) = — C(a). Lemma [5.51 
gives 



s— 1 s s — 1 



A(a^) = ^(^n - E ^(/^^) = - E ^(^^) + E ^(/^^) 

2—1 2 — 1 2—1 -i— 1 

= -A(a) 

as required. □ 
Now, assume that Ai(a) < 0. Then, 

, , n + 2A(a) -n + 2A(a'^) n + 2A(q;^) 

Ai(ck) = : — = : < : 

^ ^' n + 2A n + 2A " n + 2A 

= Ai(a^) 

Therefore, 

< (2 ^ dim(i?„(a))Ai(«)2* j (6.3) 

\ MAi(q)>0 / 

Hence, it will suffice to provide upper bounds for the eigenvalues. The argument 
follows the approach laid out by Diaconis and Shahshahani in their analysis for 
Sn [2]- For their walk, the eigenvalues were functions of partitions A, and it 
turned out that a good bound for these eigenvalues could be derived using only 
the largest part of A. A similar trick is used to bound Ai(a). 

As noted earlier in Remark 15.31 for h — &«(/, 3), there are only two left- 
equivalence and two right-equivalence classes. Furthermore, by Equations (|5.ip 
and (IE21), \Li\ = /(n), IL2I = n- f{n), and |i?i| = g{n), |i?2l = n-g(n). Thus, 
by Definition 15.41 a 6-partition a can be written as 

a = (Ai,pii, A2), where |Ai| = /(n), \ = f{n) ~ g{n), IA2I = n - g(n), 

(6.4) 

and Ai 3 /ii C A2 

Hence, by Lemma [5. 101 for a — (Ai,/ii, A2), 

A(a) = C(Ai)-C(Aii) + C(A2) (6.5) 

Since by Equation (|5.6I) . Ai(q:) is just {n + 2A(a))/(n + 2A), it suffices to find 
bounds on the above quantity. 

The largest parts of the partitions will be used to bound A(a). For sequences 
of partitions a — (Ai, ^1, A2), the bounds will depend on the sequence of largest 
parts of Ai, /ii and A2. In order to slightly simplify calculations, instead of 
working directly with the largest parts of a, it is convenient to work with what 
remains when the largest part has been taken away. The following definitions 
and lemmas carry out these ideas. 
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Definition 6.2. Let X be a partition of n. Then, write A as 

\^{\\\\\\...) 

where A^ > A^ > A^ . . . ; m particular, the largest part of X will be denoted by X^ . 
Futhermore, note that (A^, A^, . . . ) is a partition of n — X^ . Call this partition 
the remainder of X, and denote it by X^'^ . 

Now, let a be a b-partition, where a = (Ai,/ii, A2). Then, define 

a'^{Xl,f,l,Xl) 
a«^ = (Af^/.f^Af) 

analogously to above. Furthermore, define 

|a«^| = {fin) - XlJin) - gin) - A^}, n - gin) - A^) 

Note that 

\„Re\ n\Re\ \,,Re\ \\Re\\ 

The eigenvalue A (a) is bounded by finding a function of |q!^'^| which is 
greater than it. The following lemmas carry this out. 

Lemma 6.3. Let X be a partition of I and ^ be a partition of m such that A 3 /i. 
Assume that 

\X"''' \ = i and \ = j 
Then, for C(A) as defined in Defintion \5.9\. the following ineguality holds: 

CiX) - Cif,) < ^ - ^^^^ - ^(/ - z + 1) + Jim - j + 1) 

Furthermore, if j < m/2 and i < 1/2 then equality is achieved at X = H — i,i) 
and fi = (m - j, j). 

Proof: Let sqia, b) denote the square of the Ferrers diagram in column a and 
row b, with the square in the upper left corner denoted by sqil, 1). The content 
of a square x = sqia, 6) is a — b. This increases as the column a increases, and 
decreases as the row b increases. 

Note that C(A) - C(/i) = C(A//i), since A 3 ^. Since jA^'^j = i and = 
j, Xi = I — i and /ii = m — j. Thus, the first row of A//x contains the squares 
sqim — j + 1, 1), sg(m — j + 2, 1), . . . , sqil — i, 1). Furthermore, A//x contains 
another i — j squares in other rows. 

Now, start with the Ferrers diagram that consists only of the first row squares 
sqim — J + 1, 1), sqim — j + 2, 1), . . . , sg(/ — i, 1) and add squares until reaching 
a skew partition that contains I — m squares. Since |^^'^| = j, the first square 
added must be at most in column j + 1 and at least in row 2. By similar logic. 
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the rth square added must be at least in column j + r and at least in row 2. 
Thus, the content of the rth square we add is at most j + r — 2, so 

C(A//i) = C(squares in first row) + C(remaining squares) 

< ((m - j) + • • • + a - i - 1)) + ((j - f ) + • ■ • + (i - 2)) 

^('-■)-("vo-e^')-e^') 

P — I m? — TO , , 

= — i{l-i + l)+ji7n^j + l) 

as required. Furthermore, if j < to/2 and i < 1/2, then is straightforward to 
check that A = {I — i,i) and fi — (m — are partitions, satisfy A C /i, and 
achieve equality. □ 

Example 6.4. Let / — 9 and i — 4, and let to = 5 and j — 2. Then, A has 5 
squares in the first row, and n has 3. This A//Lt has 2 squares in the first row, 
and 2 more squares somewhere below the first row. Clearly, the first row of A/^ 
looks like 




where the dotted lines correspond to the squares of fi. Now, the remaining 2 
squares of A//i need to placed somewhere. By definition of content, C{\/ ^) is 
maximized by placing these squares as far up and as far to the right as possible. 
Clearly the best thing to do is to put all the remaining squares in the second 
row - it is easy to see that otherwise, a square can be moved up and to the right, 
increasing content. Furthermore, since /i has 2 squares below the first row, all 
of these should be put in the second row as well, to make sure that the squares 
of A//X are as far to the right as possible. Thus, the optimal arrangement is 



I 1 1 1 1 — 

I I I 

+ 1 

I I 

( I 1 1 1 

which is precisely the case A = (5, 4) and /i = (3, 2). Note that this arrangement 
would have been impossible if i or j was too large, but the same argument would 
have still provided an upper bound. 

Corollary 6.5. Let X be a partition of I and ^ he a partition of m such that 
A 3 /i. Then, 

Furthermore, equality is achieved at X = (I) and ~ (to). 

Proof: A proof identical to the one for Lemma 16.31 above works, but instead 
the lemma itself is used. Assume that |A''^'^| = i and l/i^'^l = j. Since X ^ fx, 
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it follows that i > j and > /i-'^, so I — i > m ~ j. Combining, i{l — i + 1) > 
jim — J + 1). Thus, 

C{\)-C{ii)<— — ^{l-^ + l)+J{m-J + l) 

P — I rn? — m 

< 

- 2 2 

as required. It is easy to check that A = (l) and ^ = (m) achieve equality. □ 

The following result appears in Diaconis (Lemma 2, Chapter 3) [2J. A proof 
is given. 

Lemma 6.6. Let X be a partition of I, and let C{X) be the sum of the contents 
of the squares in A, as defined above. Let |A^^| = i. Then, 

C(A)<^-za-z + l) 

for all i, with equality achieved for i < at X — {I — i,i). 
Furthermore, if i > 1/2, 

Proof: The first inequality follows easily from Lemma [6.31 Let /i = 0. Then, 
m = j — 0, and 

C(A)<^-z(/-z + l) 

as required. From the same lemma, if j < 1/2, equality is achieved at (/ — i, i). 
To prove the second inequality, note that if A = (A^, A^, . . . , A''), then 



s=l 

Clearly, A* < A^ for each s. Thus, 

s— 1 s— 1 

2 ^ 

s=l 

Now, A^ — 1 = / — i — 1, while ^ A* is the total number of squares in the partition 
A, which is equal to I. This gives 

C(A)< - "^"2 
as required. □ 
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The next two lemmas bound the eigenvalues Ai(a). The first lemma will 
be used when | A^*''^ | is sufficiently large, and the second lemma will be used the 
rest of the time. 

Lemma 6.7. Let a = (Ai, /ii, A2) be a b-partition, where b — bn{}, g), such that 
IA^"^! — k > {n — g{n))/b. Then, for sufficiently large n, 

Ma) < I 

Proof: From Equation (|6.5|) . 

A(a) =C(Ai)-C(/ii) + C(A2) 

From Equation (|5.6p . 

, , , n + 2A(a) 
n + 2A 

and from Lemma 12.101 

n + 2A - 2ng{n) + 2g{n)f{n) 

Since A2 is a partition of n — g{n), and |A|^'^| — k > {n ~ g{n))/5, Lemma 
gives 

^ (n - gjnjf - (n - g{n)) _ k{n - g{n)) ^ 2{n ^ g{n)f 
^ ^' - 2 2-5 



Further, from Corollary [ 

C(Ai) - C{^,,) < - (/(n)-gH)^-(/(n)-.g(n)) 

Since f{n)/n approaches 0, g{n) < f{n), and n + 2A — n'^ — 2ng{n) + 2g(n)f(n), 
the above expression for C(Ai) — ^(/ii) is negligible compared to n + 2A. Sim- 
ilarly, n/{n + 2A) approaches as n — 00, 

hnr Ma) = lim » + 2C(A,) - 2C(,0 + 2C(A2) 

n—^oo n— >-oo fl a A 

^ lim - ^ (?i - .g(n))2 



n-!-(X) 5 \n'^ — 2ng{n) + 2f{n)g{n) 
4 
5 



again using the fact that f{n) and g{n) are negligible compared to n. Thus, for 
sufficiently large n, Ai(a) < 9/10, completing the proof. □ 

Lemma 6.8. Let a — (Ai, /ii, A2) be a b-partition, where b — bn{f, g), such that 
\ = {i,j,k). Then, 

Ai{a) < 1 - si{i) + S2U) - S3{k) 
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where 

si{i) 



■ 2»(/(«)-i+l) . /(n) 

n+2A ' ^ 2 

'/(") „■ > /(") 

I, n+2A ' - 2 

2j(/(n)-g(n)-j + l) 
S2U) = 



S3(fc) = 



n + 2A 

2k{n- g(n) - fc + 1) 



n + 2A 

Proof: From Equation (|6.5p . 

A(a) = C(Ai)-C(/ii) + C(A2) 
Furthemore, from Lemma 16.61 

'--l-^l^ - \ f{nf-f{n) »/(«) ,->/M 
K 2 2 ' — 2 

Thus, from the definition of si(i), 

C(Ai) < si(i) 

Using Lemma 16.31 and simphfying, 

C{X2) - C(/ii) < A - ' ' ^^20) + -^—S3ik) 

using the fact that 2A = — 2ng{n) + 2f{n)g{n) — n. 
Thus, combining the two, 

A(«) = C(Ai) - C{fH) + C(A2) < A - ^^i^ - S2(j) + s^ik)) 

S\ / \ n-\-2A(a) 
mce Ai(a) = „+2A ' 

Ai(a)<l-si(«)+S2 0')-S3(j) 

as required. □ 



7 Lead Term Analysis and Chi- Squared Lower 
Bound 

The eigenvalue bounds derived above will now be used to provide some lead 
term analysis and also to prove the lower bound of Theorem 12. 121 This will use 
Lemmas 16.51 and 16.61 - Lemma 16.81 could also be used, but the previous lemmas 
are more hands on. 
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Let a — (Ai, fii, X2) be a 6-partition, where b = 6„(/, 5). Then, 

A(a) = C7(Ai)-C(/ii) + C7(A2), 

Lemmas 16. 51 and 16.61 show that C(Ai) — C(/ii) is maximized at Ai — {f{n)) and 
fii — {f{n) ~ g{n)), whereas C(A2) is maximized at (n — g{n)). Thus, letting 

"0 = ((/("-)), (/N - 9{n)), {n - g{n))), 

A(ao) must be the maximal eigenvalue of U. Indeed, doing the calculation gives 

^^^■^ ^ - n - 2ng{n) + f{n)g{n) ^ ^ 

which is precisely the expected maximal eigenvalue of an adjacency matrix of 
a regular graph with degree A. Intuitively, the next highest eigenvalues should 
be A(a) for a close to ao. If a = (Ai,^i, A2) ^ ao, clearly either Ai ^ {f{n)) or 
^2 7^ {n — g(n))- Thus, the obvious candidates for next highest eigenvalue are 

ai = ((/(n) - 1, 1), (/(n) - g{n)), (n - g{n))) and 
a2 = ((/H), (/H - gin)), (n - g{n) - 1, 1)) 

Indeed, if a = (Ai, /ii, A2) and Ai =/= (/(ti)), then from Lemmas 16. 61 and 16. 51 it is 
easy to conclude that 

A{a)^C{Xi)-C{pii) + C{X2) 

< ^^"^'^'^^"^ - fin) + (C(A2) - 

< Cifin)) - fin) - Cifin) - gin)) + C(n - gin)) 

< A(ai) 

and similarly, if A2 ^ (n — gin)), 

A(«) < Cifin)) - Cifin) - gin)) + C(n - gin)) -in- gin)) 
< A(a2) 

The above arguments (or a straighforward calculation) should make it clear that 

A(ai) = A - and A(a2) = A - (n - .g(n)) (7.1) 

The following steps derive the bounds corresponding to ai and a2 for the lead- 
term analysis. 

Bound corresponding to ai: 

Here, the term dim(i?„(ai))Ai(ai)^* is used, where 

«i - iifin) - 1, 1), if in) - gin)), in - gin))) (7.2) 
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as defined above. Now, from Equation 15. 3[ 

dim(i?„(ai)) = |XaJ |^Ai/pi| l-'^Aa/^il \Xx^\ 



As noted in Remark l5.61 the degree of a skew representation is just the number 
of standard Young tableaux of that skew shape. Thus, 

since choosing a standard Young tableau for {f{n) — 1, 1) just requires picking 
a number other than 1 for the second row - the numbers in the first row have 
to be ordered, and and picking 1 to go in the second row would result in a 
contradiction in the first column. 

Now, Ai//ii = {f{n) — 1, l)/(/(n) — which consists of precisely g{n) — 1 
squares starting at /(n) — g{n) + 1 in the first row, and one square in the second 
row. Since the square in the second row is not directly below any square in the 
first row, choosing a standard Young tableau just requires picking any number 
for the second row. Thus, 

Since A2//^i and A2 are both just single rows. 
Combining, 

dini(i?„(ai)) = (/(n) - l)g(n). 



A (r. \ ri + 2K{ai) 2f{n) 



From Equation (|7.1 



2A 71 + 2A 

and hence 

dim(i?„(ai))Ai(ai)^* = (/(n) - l)g{n) (^1 - ) (7.3) 

4tf{n 



ifin) - l)g("-) exp 



Thus, making this lead term at most e^^ for some constant c requires 
Atfin) 



n + 2A 

so it clearly suffices to have 



>log(/(n)-l)+log(7(n) + c 



(n + 2A)(log/(n) + logg(n)) , n + 2A 
Thus, the above bound is the contribution of ai. Turn next to a2- 
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Bound corresponding to a2- 

The case of 

«2 = ((/(n)), (/(n) - g{n)), {n - g{n), 1)) 
is entirely analogous. Identical calculations show that 

dini(i?„(a2)) = (n - g{n) - l)(n - /(n)) 

and that 



1 2(n - g{n)) 
Ai(a2) = l- ^^2A 



Hence, 



dim(i?„(a2))A2(ai)^* « (n - ^(n) - l)(n - /(n)) exp 



giving the bound 



4t(n - g(n)) 
n + 2A 



(n + 2A)(log(n - /(n)) + log(n - g{n))) n + 2A 

4(n-ff(n)) '^''A{n-g{n)) ^' 

To compare the bounds, first prove the following simple lemma: 

Lemma 7.1. // (x„) and (y„) are positive sequences such that lim„_^oo ^ = 0, 
and lim„^oo = oo, then for an arbitrarily large constant c, 

logXn logy„ 
> c- 



Vn 

for sufficiently large n. 
Proof: Rewriting, 

log?/n _ log(2/„/a;„) + log 

= — log — + loga;„ 

Xn\yn KVnJ Vn 



Since |^ approaches 0, and Xn approaches cx), — |^ log \jf"j approaches and 
thus is eventually less than ^ loga;„. Similarly, |^ loga;„ is eventually less than 
^ loga;„. Combining, for sufficiently large n, 

as required. □ 
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One of the assumptions in Theorem [TT^] is that f{n)/n approaches 0. Since 
by definition (7(n) < f{n), the bound in Equation (|7.5p is of order (n+2A) logn/n, 
whereas the bound in Equation (|7.4p is of order {n + 2A) log /(n)//(n). The 
above lemma with Xn — f(n) and i/n = n makes it clear that Equation (|7.4p is 
the stronger bound. This lead term analysis suggests chi-squared cutoff around 

^ ^ (n + 2A)(log/(n) + logg(n)) 
4/(n) 

with a window of order '".t?'^ ■ 

4/(n) 

Having derived the second highest eigenvalue, the lower bound part of The- 
orem [2ll2l can be proved. If t is defined as 

(n + 2A)(log/(n)+logg(n)) n + 2A 

4/(n) Ufin) ^'■'> 

it is shown that 

for sufficiently large n. 

Proof of Lower Bound in Theorem [2TT2t Let ai = {{f{n) - 1, 1), (/(n) - 
g{n)), {n ~ g{n))) as in Equation (|7.2p . From Equation (|6.ip . 



|P*(x, •) - ttII^^^ = dim(i?„(a))Ai(a; 



2t 



> Vdim(i?n(ai))Ai(ai)2* 

Furthermore, from Equation (|7.3p . 

dim(i?„(ai))Ai(ai)^* = (/(n) - l)g(n) fl - ' 

\ 71 + 2A . 

Since — )• 0, — > as n approaches oo. Thus, for t as defined in Part 2 
of Theorem 12. 121 and as restated above in Equation (|7.7p . 

hm (/(n) - l)g(n) fl - ^^V' = lim (/(n) - l)g(r.)exp f-^^^^^^ 



n-^oo y n + 2A / n->co y n + 2A^ 

= lim (/(n) - l).g(n)e~'°s^(")-'°S9(")+" 

n— >-oo 

= hm — „, , = 

since f{n) oo. Combining the equations above, 

•) - Trjl^ ^ > \/dim(i?„(ai))Ai(ai)2t — > e^^^^ as n ^ oo 

c/2 

and therefore, for sufficiently large n, the chi-squared distance is at least 
This proves the lower bound. □ 
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Remark 7.2. Note that while the assumptions of Theorem 12 . 121 are used in the 
above proof to simplify computation, their full strength is not needed. Indeed, a 
similar lower bound can be derived for almost any functions / and g. However, if 
n — g(n) < f{n) then this lower bound will not correspond to the second-highest 
eigenvalue, as that will be associated to the a2 defined above. 



8 Dimensions of Eigenspaces 

This section continues with the proof of the the upper bound in Theorem 12.121 
Here is a recap of the bounds in Section [51 

Definition 8.1. Let {i,j,k) be a triple of integers. Then, define 

I 1 — Si(ij + S2U j ~ S3(k) ottiermse 
where Si, S2 o-nd S3 are defined as in Lemma \6.8\ - that is, 



( 2»(/(»)-i+l) . /(«) 

- \ ^f{n) . . f{n) 

{ n+2A ' - 2 

2j(/(n)-ff(n)-j + l) 
S2U) ^ 



S3(fc) 



n + 2A 

2k{n - g{n) - fc + 1) 
n + 2A 



Remark 8.2. The above s{i,j,k) are chosen to simplify notation. Combining 
Lemmas 16.71 and 16. 8[ 

Ai(a) < s(|a^^"|) (8.2) 

which will clearly be useful. 

Here is a sketch out the rest of the proof. By Equation (|6.3p , the quantity 
to be bounded is 

J2 dim(i?„(a))Ai(a)2* 

1/Ai(a)>0 

Since the upper bound above depends purely on | a^*^ | , rearrange the above 
quantity by the value of | a^'^ | . From the previous section, it is clear that 
|q!^^| = (0,0,0) corresponds to Ai(q;) — 1. Thus, 

dim(i?„(a))Ai(a)2* < ^ dim(i?„(«))s (|a«"^|)'* 

1#Ai(q)>0 la^" 15^(0,0,0) 

= K^'J'fc)" E dim(i?„(a)) 

(8.3) 
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The proof below will be organized as follows. The current section finds an 
expression for the sum 

dim(i?„(a)) 

in terms of and k. This leaves the (rather unwieldy) sum on the right-hand 
side of Equation (|8.3p in terms of the three indices and k. In Section [HI 
this sum is split into a number of pieces that depend on the precise values of 
the indices, and supporting lemmas are proved for the size of each piece. All 
the bounds are then combined into a proof of the upper bound part of Theorem 

The current section is devoted to proving the following lemma: 
Lemma 8.3. Assume that i > j < k. Then, 



E 



|Q«=| = (ij-,fc) 



.M«„,«)).,^<"')(f5;'].)(".-^r)(""r')f 



Remark 8.4. Note that \a 
•^1 ^ Ml ^ -^2- Thus, if 



I is defined to be (| Xf '' | , | ^f '^ | , | A|*''^ | ) , where 



a 



then i> j < k. Therefore, the the condition in Lemma 18.31 is the natural one. 
Definition 8.5. To simplify notation from now on, define 



a{i,j,k) 



f{n)\ f 9{n)\ fn- f{n)\ /n - g{n) 



The n in the above expression will always be implied. 

A number of supporting lemmas will eventually yield Lemma 



above. 



Lemma 8.6. Let a = (Ai, /ii, A2) be a b-partition, where b = bn{f, g), satisfying 



v-Re I 



{i,j,k). Then, 
dim(i?„(a)) < a{i,j,k) X^^r 



X\Re / ,,Re 
^2 /Ml 



(8.4) 



Proof: From Equation 



in Theorem E 



dim(_R„(Q;)) 



I^Ail I^Ai/^il \X\2 



\X 



Furthermore, as noted in Remark 15.61 is the number of standard 

Young tableaux of shape a//3. Use this to examine the expression above. 
Clearly, \Xx^ \ is the number of standard Young tableaux of shape Ai. A naive 
way of trying to construct such a tableau would be to pick the f{n) — i elements 
of {1, 2, ... , /(n)} that will go in the first row of the standard Young tableau of 
shape Ai, and then use the remaining i elements to construct a 'shifted' standard 
Young tableau of the remainder of Ai, called A{^^ above. Note that once the 
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elements of the first row arc chosen, they must be arranged in exactly increas- 
ing order: thus, each choice of subset and of remaining 'shifted' tableau results 
in exactly one Young tableau. While this will vastly overcount the number of 
standard Young tableaux of shape Ai, every single one can be constructed in 
this way. Thus, 



fin) 
f{n)-i 



fin 



Similarly, to construct a skew tableau of shape Xi/ Hi, pick the g{n) — — j) 
elements for the first row, and construct a 'shifted' tableau of shape Af^//if^ 
with the remaining numbers. Thus, 



\X 



Ai/jUi 



< 



gin) 



XyR. 



Proceeding in this way. 



X 



< 



I^A.|< 



n - fin) 

k-j 
n - g{n) 
k 



Af'/^'f 



Multiplying the above inequalitites together gives precisely 



dim(i?„(a)) < a{i,j,k) 

as desired. 

Using the lemma above, 

y~] dim(i?„(a)) 

|aRe| = (i,j,fc) 

<a{i,j,k) ^ 



X\Re 



AfV,^f« 



X^Re 



^AfV^f" 



X^Re 



In order to simplify things slightly, the following simple lemma is useful: 
Lemma 8.7. For any k, 



E 

|aRe|=(i,j,fe) 



^Af= 



X\R~ 



\X),Rb 



(A'l.K.Ai) 



□ 



where the right-hand sum is over triples of partitions such that \X[\ = i, = 
j, IA2I = k, and X[ D iJ.[ C A2. 
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Proof: Clearly, if \a^''\ = {i,j,k), then 

lAf'^l = i, = j, \X^^\ = k, and Af" D /xf« C A^^ 

Furthermore, given b = bn{f, g), Oi^^ uniquely determines a (although not every 
choice of (A^, /i'l, A2) will actually produce an a). Hence, the lemma follows 

trivially. □ 

Now, note that the sum on the right-hand side of the above lemma looks 
a lot like a sum of dim(i?„(a')) over a-partitions a' for some a. Using this 
heuristic, consider rewriting the above as 

^dim(ii„(a')) = \Sm(S)\ 

where the equality follows since the sum of the dimensions of the cigcnspaces is 
just the dimension of the whole space. 

However, the above heuristic has the following trouble: if cither i = j or 
2 = k, then the (A'j^, /z'j^, Aj) above are not in fact a-partitions for any a, since 
\'i = /z'l or ji'i = A2 are forbidden. However, this turns out to be a non-essential 
part of the definition of a-partitions. The following supporting lemma overcomes 
the issue. 

Lemma 8.8. Let n > m, and let X be a partition of n. Then, the following 
equality holds 

jU<A,|/i|— m 

Proof: This may well be a well-known formula. However, it has a simple combi- 
natorial proof presented below. As noted above, \X\ \ is the number of standard 
Young tableaux of shape A, and similarly for fi and X/fi. Let Tab{s) denote 
the set of standard Young tableaux of shape s, whether s is a partition or a 
skew-partition. Then, what is needed is a bijection g such that 

g : U Tab{n) x Tab{X/n) — ^ Ta6(A) (8.5) 

\n\=m 

where the union is over partitions /i. 

To define the bijection note that if Y\ is a standard Young tableau of 
shape A and Y\/^ is a Young tableau of shape A//x, a Young tableau of shape 
A can be created by adding m to every entry of the Y^/f^ and then sticking the 
two tableaux together. Call this new tableau giYfj^^Y^/^). 

For example, if A = (4,2,1), and ^ = (3,1), then the following Young 
tableaux 





1 


3 


4 




2 





can be combined into ^(F^, Y^/ij,), which is 
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1 


3 


4 


5 




2 


7 








6 









Clearly, for any G Tab{^) and Ia/^ G Tab{X/fi), the above bijection 
results in a tableau of shape A which is filled with the numbers {1, 2, . . . , n}. 
Thus, it remains to check that the rows and columns in g{Y^,Y\/f^) are in 
increasing order from left to right and from top to bottom. 

Proceed by contradiction: assume there are i and j such that i < j, but j 
is strictly to the left of i, or strictly above i in g{Yfj_,Yx/f^). If i and j are both 
at most m, then they both appeared in Y^, and this is impossible; similarly, if 
both i and j are greater than m, then i — m and j ~ m both appeared in ^a/zx 
and this is similarly impossible. Thus, assume that i < m < j. But then j must 
be in a square belonging to A//i, and i must be in a square belonging to /i, and 
therefore it is impossible for j to be strictly to the left or strictly above i. Thus, 
g{Yf^, Yx/^) is a standard Young tableau, so the map g is well-defined. 

Now for the inverse / of the map g. It is easy to see from above that fi is 
defined precisely by the set of squares which contain the numbers {1, 2, . . . , to}. 
Using arguments identical to the above, in any standard Young tableau Y\ the 
set of squares containing the elements {1, 2, . . . , to} is a partition of to. Thus, 
the inverse map / must map Yx to (V^, Yx/^): where Y^ is simply the standard 
Young tableaux induced by the squares containing {l,2,...,m}, and Y^/^ is 
obtained by deleting the squares in n from Yx, and subtracting m from the 
remaining squares. For example, if 



Yx^ 


1 


2 


5 


6 




3 


4 








7 









then the inverse map / maps it to the pair 



Y,^ 


1 


2 


Yx/,^ 


1 


2 




3 


4 









3 



and hence the bijection has an explicit inverse. This shows that the sizes of the 
sets in Equation (|8.5|) are equal, and hence 

/^<A,|/^|— m 

as required. □ 
The next lemmas tackle the expression on the right-hand side of Lemma ISTZl 
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Lemma 8.9. Let {i,j,k) be a triple of positive integers that satisifes i> j < k. 
Define 

^= E \^K\\Xyj,'MX,y,'MXy^\ (8.6) 

where the sum is over triples of partitions {X[, fi[, X2) such that \X[\ = i, \^[\ = 
J, IA2I ^ k and X'l D fi'^ C A2. Then, 

i\k\ 



A 



Proof: U i > j < k, define the vector a to be 

a^{l,l,...,l,i-j + l,...,i-j + l) 

where the number of initial Is is i, and the total number of terms in the vector 
is i — j + k. Then it is easy to check that the set of all {X'l, fi'i, X'2) such that 
\X'i\ — i, = j, IA2I = k and X'l D C X'2 is the set of a-partitions, as defined 
in Definition El Thus, 



^ - E I I I ^Ai /pi I I ^A^ /m'i I I ^A^ I 

(A2,/J-^,A2) a-partition 

J2 dim(i?„(A'i,//l,A^)) 

(A'^j/^^jAg) a-partition 

where the second equality follows from Equation (|5.3p in Theorem 15.81 But 
from the same theorem, 

a a-partition 

Thus, 

^ dim(iT!„(a)) = dim €SM{a) = |<5'A/(a)| 

Q a-partition 

But for the choice of a, from Lemma [321 



I ._i\k\ 

PM(a)| - — ]- 

finally yielding that 

i\k\ 



A= ^ dim{Rn{a)) = \SM(a 



a a-paititioii 



as required. 

It remains to explain how to handle the case where i — j, or j — k, or both. 
Consider the case i = j < k (the same method will apply to all the other cases.) 
As before, 

^ = E l^^i I I l^K/t^i I I ^^2 I 
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where the sum is over \X'i\ = i, = j, jA^ — k and \[ I) fi[ C' A2. Since i = j, 
and X'l D /i'j^, it must be that X'l = ii'i. Thus, 

where the sum is over \X'i\ — i, IA2I = fc, and X'l C A2. But from Lemma 
for a fixed A2, 

A;cA^,|A;|=i 

Thus, 

|Ay = fe 
|Ai|=fc 

and the sum on the right is well-known to be kl. Therefore, in this case, 

ilkl 



A = k\ 



J 



since i = j- The other cases with equality can be done similarly, completing the 
proof. □ 

With all these preliminaries, the lemma from the beginning of the section 
can now be proved. 

Proof of Lemma 18.31 From Lemma 18.61 
dim(i?„(a)) 

|c«H«| = (i,j,fe) 

<a{ij,k) ^ X^R. X^ny^Rc X^R./^R. X^R. 

Combining this with Lemma 18.71 

dim(i?„(a)) < a{i,j,k) ^ \Xx[ \ |^a;/m'J l^x'^/i^'^l I^kI 

\a'i''\ = {t,j,k) (A'i,Ai'i,A^) 

where the right-hand sum is over triples of partitions such that jA'^j — i, \ fi'i\ = 
j, IA2I = k, and X'l D ji'i C A2. However, from Lemma [8^ 



m 
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and thus 

22 dim(i?„(a)) < a(i,j, fc)— p 

|a««| = (ij,fe) 

as required. □ 



9 Chi-Squared Upper Bound 

This final section puts all the quantities together to prove the upper bound part 
of Theorem 12.121 For the remainder of this section, let t be defined the way it 
is for the upper bound; that is, 

(n + 2A)(log /(n) + log g(n)) , n + 2A 



From Equation (|8.3|) . 

^ dim(i?„(a))Ai(a)2* < ^ s{t,j,k)^' ^ dim(i?„(a)) 

15^Ai(q)>0 (i,j,fc)#(0.0,0) |Q«'=| = (i.j,fc) 

(9.2) 

where s{i,j,k) is defined as in Definition 18.11 From the lead-term analysis in 
Section [71 the 'limiting' eigenvalue corresponds to the case {i,j,k) = (1,0,0). 
This suggests that term is the largest. 

The above sum will be broken up into various pieces and bounds will be 
proved for each piece. There are three zones: 

1. The first zone is k > {n — g{n))/5. As should be clear from the definition of 
s(i, J, fc), this is the zone with a constant upper bound for the eigenvalues 
Ai{a). This case is fairly straightforward, and will be done in Lemma [921 
below. 

2. The second zone is i = and fc < (?i — g(n))/5. For this case. Lemma 
18.31 is used for the upper bound. From the heuristics in the lead term 
analysis, the expected limiting term for this piece is (0, 0, 1) - since it was 
noted that this term imposes lower order restrictions than (1,0,0), this 
case should also be fairly simple. This will be done in Lemma [ 



3. The final zone is i > and k < {n — g{n))/5, for which Lemma 1531 is again 
used. This is the case that contains the limiting term (1,0,0), and as such 
should provide the biggest bound. This will be done in Lemma [ 

To simplify notation, make the following definition: 
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Definition 9.1. Define 

f / ■ N 1 , 11 — n(n) 



A2 = I k) + (0, 0, 0) I k < " = 



A3 = |(^,J■ fc)|fc<^^^,*y^O 

This corresponds to the pieces being bounded. 
Lemma 9.2. Using the definition of Ai above, let 

Qi= E s(^J.fc)'* E dim(i?„(a)) 

(i,j,k)eAi \aR'\ = {i,j,k) 

Then, for sufficiently large n, Qi < 

Proof: By Definition [HI for {i,j,k) e Ai, s{i,j,k) = 9/10. Thus, 

E (Ti^T E dim(i?„(a)) 



,10, 

(ij',fe)eAi |a-'*=| = (i,j,A:) 



< 



< 



10, 
10 



dini( j?n(a)) 



since dim(i?„(a)) = ^a/I^) 
ipTTjl above, 



< n\. Then, using t as defined as in Equation 



Q X n'(log/(n)+logg(n))/2/(n) 



«"(log/(n)+logg(n))/2/(n) 

10, 



< ( ^ I 



n log n 



From Lemma [7.11 for sufficiently large n, the above is at most e~"'°s" — n"", 
completing the proof. □ 

Before continuing, a simple supporting lemma is needed. 

Lemma 9.3. With notation as above, 

i\kl f{nyg{ny-^{n - f{n)f-^{n - g{n)f 
-TT-a[i,j,k) < — -— 
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Proof: This follows easily from the definition of a{i,j, k) in Definition 18.51 and 
the fact that H < ^ . □ 



Lemma 9.4. For 



A2 = \ ii,j,k) ^ (0,0,0) I k < " f^"\ » = 



let 

Q2- E dini(i?„(a)). 

(ij\fc)eA2 |Q^'=| = (ij',fc) 

Then, for sufficiently large n, Q2 < 2(n — g(n))^^. 



Proof: By Definition [811 

s{i,j,k) = 1 - si{i) + S2{j) - S3{k) 

since 1 — x < for all x. Furthermore, from Lemma 



22 dim(i?„(Q;)) < a{i,j, k)—y- 

|a«<=| = (jj,fc) ^' 



for a(i,j, k) as in Definition 18.51 Combining the above, 

Q2< e-2*-i«+2*^^W-2*^^«a(i,j,fc)^ (9.3) 

(ij,fc)eA2 

Now, for {i,j,k) G A2, i = 0. Since i > j, this also means that j ~ 0. Using 
Definition 18. f I and Lemma 19.31 



si(i) = 0,S2(j)-0and— a(i,j,fc)< ^ ' " ^ 



j! fc! 

Thus, the above inequality simplifies to 

^ ( ^tk{n~g{n)-k^\)\{n-j(n)f{n^g{n)f 

i<k< "-?'"'> 



n + 2A / fc! 



and hence, using the fact that g{n) < f{n), 

4:tk{n - g{n) - fc + f ) \ (n - g{n)Y'' 



Q2< Y 



l<k< 



< 



E 



exp 



exp 



i<fe< 



2A J fc! 

mtk{n~g{n))\ {n - g{n) f^ 



5(n + 2A) y fc! 
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Now, using Lemma 1 7 . 1 1 with x„ — f{n) and y„ = n — g{n), for sufficiently large 

n+2A - 4/(n) - n-g(n) ' -L^^U^'j 

Q2< X! exp(-8fclog(n-3(n))) 



(n - 5(n))2'= 



i<fc<- 



-g('i) 



and hence the above simplifies to 



Q2< E ^ <2(n-g(n)) 



i<fc< 



5 



as required. □ 

It is easy to see that the lemmas above can be manipulated to provide 
arbitrarily good bounds. As noted previously, this is because the 'limiting' term 
(1, 0, 0) does not make an appearance here. The case i will be considerably 
more tricky (or at least more tedious.) 



Lemma 9.5. For 



let 

Q3= E E dim(i?„(a)) 

('i,j,fe)GA2 |Q-R=| = (ij,fc) 

Then, for sufficiently large n, 



Proof: Manipulating as in Lemma 19.41 gives an equation analogous to Equation 
([9?3| above: 

(ij,fc)Gyl3 



Do the above sum over fc, then over j, then finally over i. Using Lemma 19.31 

(*-j)!(fc-j)!j! 



^ ^-2ts,(i)+2ts,(,)-2ts,(k) f{nTginy ^{n-f{n) f - g{n)f 



{iJ,k)GA3 

^ V ~2ts, (i)+2ts,(,) f{nyg{nf-' 

(9.4) 

Fix z and j and do the sum over fc: 
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Summing over k: If (i,j,k) G A^, then by definition k < {n — g{n))/5. Also, 
k>j, giving 

Denote the fcth term of the above sum by rk- The idea will be to show that 
fk+i/fk is less than 1/2, and thus to bound the sum by 2rj. Explicitly, 

rk+i ^ ^-2tss(k+i)+2ts,(k) (" - /("))(" - gW) 
rk k- j + 1 

Mi(2fc -n + g{n)) \ {n ~ g{n)f 
n + 2A / k- j + 1 

--(^^gST) <"-■""»' 

and again using Lemma 1 7. II with x„ — f{n) and yn — n — g{n), for sufficiently 
large n, > > 5 '°g^"7^"» . and thus 

o ' n+2A — 4/(n) — n — g{7i) 

^''^^ < exp(-121og(n-5(n))) (n - g(n))^ 



rk 



(n-g(n))-i°<i 



for sufficiently large n. Therefore, 

^ e-2ts3(/c) {n-g{n)) ^ ^ ^ 

Plugging this back into Equation (|9.4p . 

< 2 e-2*«^W finy e'ts,U)-2ts,U) g(")''^'["^-^ g^"))' (9.5) 

again using the fact that if |q!^^| = fc), then i > j, and also z < f{n). Now 
fix i, and do the sum over j. 



Summing over j: The sum to be bounded is 

i 

E 



2ts, if)-2ts, if) gjny ^(n-ff(n)y 
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From Definition 18. 11 

2j{n - f{n)) 
S2{j)-ss{j)^ 

Hence the sum simplifies to 

4tjin-fin))\ g{ny-i{n-g{n)y 



J=0 \ T / 



By a slight abuse of notation, let rj again be the jth summand of the above 
sum. As above, bound the ratio between rj+i and rj to bound the sum. Here, 

rj+i ^ ( 4t{n~f{n)) \ (n - g{ji)){i - j) 
r, ^\ n + 2A y g{n){] + l) 

( it(n- f(n))\ , , ^^ ^ 

- ""^^ \ n + 2A ) ^" " ^^("))^(") 



Now, using Lemma |7. II with Xn = /{n) and y„ = n — f{n), for sufficiently large 

„ t \ log/(") \ log("-/(")) TVinc 
n+2A - 4/(n) - n-f{n) ' -L^^Llh, 

= exp (-4 log(n - /(n))) (n - g{n))f{n) 

(n-g(n))/(n) 1 
- (n-/H)4 ^2 

for sufficiently large n, and as before. 

Plugging this back into Equation (|9.5p , 

Q3<4y e-^*-^^^) ^^"^y^^' . (9.6) 



1=1 



Summing over i: Break up the above sum into two pieces: i < and 
i > Bound the first case first. Consider the summation 

5 



E 

Recall that 



e 

i=l 



■ 2i(f(n)-i+l) ■ ^ f(n 



^i«= (9.7) 



n+2A * - 2 
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Thus, fori < 

2^^^^^^ ^ 2 / (n + 2A)(log/(n)+logff(n)) ^ + 2A^^ 2i{f{n) + 



4/(n) 4/(n) / n + 2A 

= (log/(n) + log5f(n) + c) i - — — 

V J[n) 

Now, i — is a quadratic function in i which corresponds to an upside down 
parabola, and as such is minimized at the endpoints of an interval. Plugging in 
i = 1 and i = for 1 < i < i-^>l. This gives that 



Thus, 



Hence, 



2tsi{i) > (log/(n) +logff(n)) ( i - ] +c 



e-2t.i« < exp (^-(log/(n) + log5(n)) (^i - 
= f{n)-'g{n)-'f{n)7(^g{n)T^e-' 

/(n)/5 .i , ,i /(")/5 



S=l 1=1 

By yet another slight abuse of notation, let r, be the ith summaiid of the above 
right-hand sum - bound the ratio between consecutive terms to find bounds on 
the sum. Then, since g{n) < f{n), 

Now, consider the above expression on the right-hand side as a function of 
i. By differentiating, it is easy to check that it is increasing for i -|- 1 > 4iog 

and decreasing for i + 1 < 4 log /(„) ■ Thus, in order to find its maximum, just 
check the endpoints. Plugging i = 1, the result is ^f{n)^/^^"K Since f{n) — >■ 00, 
this approaches 1/2 as n — >■ 00. Thus, for sufficiently large n, this is at most 
2/3. Plugging in i = /(n)/5, 

/(n)W<5/(n)-V5 



/(n)/5 + 1 



which clearly goes to 0, and hence is less than 2/3 for sufficiently large n. 
Therefore, rj+i/r^ is less than 2/3 for suflaciently large n, and hence 

/(«)/5 I 

J2 f{n)7T^g{n)-si^-= ^ r, < 3ri = 3 
1=1 *■ 1=1 
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and therefore 

/(»)/5 



Y: e-^*-^W ^(")y")' <3e- (9.8) 



I 

i=l 



It remains to bound the sum for i > f{n)/h. This is 

/(«) 



E 



i=/(n)/5+l 

It is easy to check from the restated definition in Equation (|9.7p that for i > , 

Sl U > — -r-r 
n + 2A 

Thus, 



exp 



/ (n + 2A)(log/(n)+logg(7i)) (7i + 2A) \ \ 
I 4/(n) 4/(n) n + 2Ay 

i(log/(n) +logg(n) + c)^ 



2 

Therefore, since g{n) < f{n), 



i=/(n)/5+l j=/(„)/5+l 



i=/(«)/5+l 



Combining the above with Equation ()9.8p . 



and thus from Equation (j9.6 



71 



?3 < 12e-'= + 4e-^+-''(") 



as required. □ 
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Proof of Upper Bound in Theorem I2.12i Combining Equation (|9.2p with 
Lemmas 19:21 [j^ and [931 



dim(i?„(a))Ai(a)2* < s{t,j,kf' ^ dim(i?„(a)) 

15^Ai(q)>0 (ij,k)=i{0,0,0} \aR'\ = {i,j,k) 

< n"" + 2{n - g{n)y^ + Ue'" + 4e~'T5^+-^(") 
Thus, for c > 10 and n sufficiently large, 

J2 dim(i?„(a))Ai(a)2* < 166-^= 

15:^Ai(q)>0 

Now, from Equation (|6.ip . 



•) - 42.. - /^dim(i?„(a))Ai(a)2t 
y a 

and thus, for c > 10 and for n sufficiently large, 

||P*(x, •) -ttIL <Ae~i 

II ^ ^ 1 1 z,7r 

as required. □ 

10 Open Questions 

As noted in the introduction, there are many questions raised by this paper 
which could be profitably explored. The most approachable one concerns the 
conditions in Theorem l2.12[ which assumes that 

/(") 

lim = and lim f{n) = oo 

n— ^oo fi n— ^oo 

It is not difficult to show that the second condition about f{n) approaching 
infinity is necessary (this fact is first noted in Remark |3. II) . Indeed, as Section 
[3| demonstrates, the first f{n) rows are the limiting factors behind the mixing 
time. Furthermore, the highest eigenvalue calculations in Section [7| also show 
that f{n) drives the highest eigenvalue. For cutoff to occur, the driving force 
behind mixing needs to grow without bound, explaining why the condition is 
necessary. 

However, the assumption that approaches does not have a similarly 
natural justification. This assumption was largely made to make the calculations 
more tractable. Indeed, tracing back through the proof shows that an identical 
argument could be made using only the assumption that 

fin) 
lim sup < c 



51 



where c is some specific constant. While I haven't calculate precisely what it 
would need to be, something on the order of 0.1 would suffice. 

Conversely, it's clear from the heuristics and the eigenvalue calculations that 
in order for the logic in this paper to be valid, f{n) needs to be no bigger than 
n — g{n). Otherwise, the highest eigenvalue is the one associated to n — g{n), 
and the mixing is driven by the last n — g{n) columns instead of the first /(n) 
rows. Thus, the condition that in the limit, 

f{n) +g{n) < n 

is necessary. This leads to the first question: 

Question 10.1. How much can the conditions in Theorem 12.121 be relaxed? If 
the assumption is that 

lim sup < c 

n— f oo ^ 

what value of c would allow a virtually identical proof to go through? What 
are the strongest conditions on f{n) (or possibly f{n) and g{n) both) that still 
allow for the same chi-squared cutoff? 

Continuing to pose questions, part of the focus of this paper is on the dis- 
crepancy between chi-squared and total variation mixing for two-step restriction 
matrices. Indeed, Theorem 12. 131 states that as long as f{n) and g{n) are com- 
measurable in the limit, then the two mixing times coincide (and both exhibit 
cutoff); whereas Theorem 12.141 states that for a wide class of /(n), as long as 
g{n) = 1, total variation mixing occurs substantially earlier and without cutoff. 
The same theorem would also go through without major adjustments under the 
assumptions that g{n) < c for any constant c; however, this is unlikely to be the 
only case where something similar happens. This leads to a number of natural 
and interesting questions: 

Question 10.2. What conditions on /(n) and g{n) are necessary so that total 
variation mixing occurs substantially before chi-squared mixing? Are those 
precisely the cases when the total variation mixing time doesn't undergo cutoff? 
If those are distinct phenomena, under which conditions on f{n) and g{n) does 
the random walk undergo total variation cutoff? 

A considerably more ambitious avenue of research involves studying a broader 
class of one-sided restriction matrices. In this paper, attention was focused on 
two-step restriction matrices because they induce a vertex transitive random 
transposition walk. However, I would conjecture that very few other restriction 
matrices have this property; see Remark 14.21 for a demonstration of what can 
happen. (For an example of a one-sided restriction matrix which isn't in this 
class but does induce a vertex transitive walk, see Example 3 in Hanlon [7].) 
This assumption made possible the use of Corollary 12.21 which requires only 
eigenvalue and not eigenvector information to achieve the requisite chi-squared 
bounds. 
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However, Hanlon does derive both eigenvalue and eigenvector information 
in his paper. While the calculations would be more difficult, it would certainly 
be feasible to apply Theorem 12.11 in the case of a general one-sided restriction 
matrix. This means that it would be possible to tackle questions similar to the 
ones in this paper for many more types of one-sided restriction matrices. Some 
of the questions which arise naturally are the following: 

Question 10.3. Is there a wider class of one-sided restriction matrices for which 
the random transposition walk can be proven to undergo chi-squared cutoff, and 
where does this cutoff occur? What kind of conditions on the restriction matrix 
ensure that the chi-squared and total variation mixing times match? Is there 
a wide class of examples for which the total variation time is an order smaller 
than the chi-squared mixing time? 

Although this brief summary is a good start, many analogous questions on 
the random transposition walk on one-sided restriction matrices can be posed. 
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